High GC content causes orphan proteins to be intrinsically disordered

نویسندگان

  • Walter Basile
  • Oxana Sachenkova
  • Sara Light
  • Arne Elofsson
چکیده

De novo creation of protein coding genes involves the formation of short ORFs from noncoding regions; some of these ORFs might then become fixed in the population. These orphan proteins need to, at the bare minimum, not cause serious harm to the organism, meaning that they should for instance not aggregate. Therefore, although the creation of short ORFs could be truly random, the fixation should be subjected to some selective pressure. The selective forces acting on orphan proteins have been elusive, and contradictory results have been reported. In Drosophila young proteins are more disordered than ancient ones, while the opposite trend is present in yeast. To the best of our knowledge no valid explanation for this difference has been proposed. To solve this riddle we studied structural properties and age of proteins in 187 eukaryotic organisms. We find that, with the exception of length, there are only small differences in the properties between proteins of different ages. However, when we take the GC content into account we noted that it could explain the opposite trends observed for orphans in yeast (low GC) and Drosophila (high GC). GC content is correlated with codons coding for disorder promoting amino acids. This leads us to propose that intrinsic disorder is not a strong determining factor for fixation of orphan proteins. Instead these proteins largely resemble random proteins given a particular GC level. During evolution the properties of a protein change faster than the GC level causing the relationship between disorder and GC to gradually weaken.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High GC Content Causes De Novo Created Proteins to be Intrinsically Disordered

De novo creation of protein coding genes involves formation of short ORFs from noncoding regions; some of these ORFs might then become fixed in the population. De novo created proteins need to, at the bare minimum, not cause serious harm to the organism, meaning that they should for instance not cause aggregation. Therefore, although the creation of the short ORFs could be truly random, but the...

متن کامل

Inferring Function Using Patterns of Native Disorder in Proteins

Natively unstructured regions are a common feature of eukaryotic proteomes. Between 30% and 60% of proteins are predicted to contain long stretches of disordered residues, and not only have many of these regions been confirmed experimentally, but they have also been found to be essential for protein function. In this study, we directly address the potential contribution of protein disorder in p...

متن کامل

Content of intrinsic disorder influences the outcome of cell-free protein synthesis

Cell-free protein synthesis is used to produce proteins with various structural traits. Recent bioinformatics analyses indicate that more than half of eukaryotic proteins possess long intrinsically disordered regions. However, no systematic study concerning the connection between intrinsic disorder and expression success of cell-free protein synthesis has been presented until now. To address th...

متن کامل

Molecular signaling involving intrinsically disordered proteins in prostate cancer

Investigations on cellular protein interaction networks (PINs) reveal that proteins that constitute hubs in a PIN are notably enriched in Intrinsically Disordered Proteins (IDPs) compared to proteins that constitute edges, highlighting the role of IDPs in signaling pathways. Most IDPs rapidly undergo disorder-to-order transitions upon binding to their biological targets to perform their functio...

متن کامل

Functional fragments of disorder in outer membrane β barrel proteins

The traditional view of "sequence-structure-function" has been amended by the discovery of intrinsically disordered proteins. Almost 50% of PDB structures are now known to have one or more regions of disorder, which are involved in diverse functions. These regions typically possess low aromatic content and sequence complexity as well as high net charge and flexibility. In this study, we examine...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 13  شماره 

صفحات  -

تاریخ انتشار 2017